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Abstract 

The paper introduces a limit version of multiple stopping options such that the holder 
selects dynamically a weight function that control the distribution of the payments (benefits) 
over time. In applications for commodities and energy trading, a control process can represent 
the quantity that can be purchased by a fixed price at current time. In another example, 
the control represents the weight of the integral in a modification of the Asian option. The 
pricing for these options requires to solve a stochastic control problem. Some existence 
results and pricing rules are obtained via modifications of parabolic Bellman equations. 
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1 Introduction 

There are many different types of options and financial derivatives: European, American, Asian, 
Bermudian, Israeli, Russian, Parisian options, etc. (see, e.g., Briys et al (1998)). Pricing of 
exotic options require special methods; see, e.g., Kifer (2000), Kyprianou (2004), Kramkov et 
al (1994), Meinshausen and Hambly (2004), Peskir (2005), Bender and Schoenmakers (2006), 
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Bender (2011), Carmona and Dayanik (2006), Carmona and Touzi (2006), Dai and Kwok (2008), 
Dokuchaev (2009). 

Typically, new type of options are developed with the purpose to offer some additional 
flexibility for an option holder. In particular, multiple exercise options are used in energy 
trading; allow a holder to distribute the purchase of energy by a fixed price over a set of 
time moments; see, e.g., Meinshausen and Hambly (2004), Bender and Schoenmakers (2006), 
Bender (2011), Carmona and Dayanik (2006), Carmona and Touzi (2006), Dai and Kwok (2008), 
Dokuchaev (2009), and Bender (2011). The paper suggests a next step in this direction. We 
develop a family of options that allows the holder to select dynamically continuous time processes 
that control the payoff. We call the new options controlled options. The control processes are 
assumed to be adapted to the current flow of information. More precisely, the holders of the new 
options select dynamically the weight functions that control the distribution of the payments 
(benefits) over time. There is a similarity with passport options introduced in Hyer et al (1997) 
as a generalization of the American option (see also Delbaen and Yor (2002), Kampen (2008), 
Nagayama (1999)). The passport options allow the holder to select investment strategies for 
an account; the writer guarantees protection from the losses. The difference with the control 
options introduced in this paper is that the holder of passport options selects portfolio strategy. 

The controlled options may have applications in commodities and energy trading. For in- 
stance, control process u(t) may represent the weight of the integral in a modification of the 
Asian option. In another example, a non-negative control process u(t) can represent the amount 
of some commodity that can be purchased by a certain given price at time t € [0, T], where T is 
the terminal time, given that J u(t)dt = 1. This is a limit case of multi-exercise option studied 
in Bender and Schoenmakers (2006) and Bender (2011), where the distribution of exercise times 
approaching a continuous distribution. Therefore, controlled options can be used also as an 
auxiliary tool to study these multi-exercise options. In some cases, analysis of these controlled 
option is more straightforward since optimal multi-stopping is actually excluded; it is replaced 
by more standard stochastic control problem. These and other examples of controlled options 
are studied below. It is shown that pricing for these options requires solution of a stochastic 
control problem rather than optimal stopping problem. Some existence results pricing rules are 
obtained in Markov diffusion setting based on dynamic programming and various modifications 
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of degenerate parabolic Bellman equations (Hamilton- Jacobi-Bellman (HJB) equation). 

The paper is organized as follows. In Section [2j two classes of controlled options are in- 
troduced: (i) options where the adapted weight u(t) is selected such that L u(t)dt = 1, and 
(ii) options where the weight u(t) does not restriction on its cumulate, and where the payoff is 
defined by the normalized weight v(t) = ^ J Q T u(s)ds^j which is not adapted. Some motivation 
for this setting is given. In Section [3l the market model is introduced. In Section [H the general 
martingale pricing formula is given. In Section [5j the pricing is discussed for the case (i). In 
Section [H the pricing is discussed for the case (ii). The proofs are given in Appendix. 



2 Controlled options: definition and examples 

Consider a risky asset (stock, commodity, a unit of energy) with the price S(t), where t G [0, T], 
for a given T > 0. Consider an option with the payoff 

F„ = *(«(•), (2-1) 

This payoff depends on a control process u(-) that is selected by an option holder from a certain 
class of admissible controls U. The mapping $ : U x S — > R is given; S is the set of paths of 
S(t). All processes from U has to be adapted to the current information flow, i.e., adapted to 
some filtration Tt that describes this information flow. 

We call the corresponding options controlled options. Clearly, an American option is a 
special case of controlled options, where the exercise time is selected. Some new examples of 
controlled options are suggested and discussed below. 

For simplicity, we assume that all options give the right on the corresponding payoff of the 
amount F u in cash rather than the right to buy or sell stock or commodities. 

Options with adapted weight with fixed cumulated integral 

Consider a risky asset with the price S(t). Let T > be given, and let g : R — > R and 
/ : R x [0, T] — > R be some functions. Consider an option with the payoff at time T 

F u = g(£u(t)f(S(t),t)dt\, (2.2) 



3 



Here u(t) is the control process that is selected by the option holder. The process u(t) has to be 
adapted to the filtration F t describing the information flow. In addition, it has to be selected 
such that 

f u{t)dt = 1. 
Jo 

A possible modification is the option with the payoff 

F U = F U = J u{t)f(S(t),t)dt+(l- J u(t)dt\ f(S(T),T). 

In this case, the unused u(t) are accumulated and used at the terminal time. 
Let us consider some examples of possible selection of / and g. 
We denote x + = max(0, x). 

Important special cases are the options with g(x) = x, g(x) = (x — k) + , g(x) = (K — x) + , 
g(x) = min(M, x), where M > is the cap for benefits, and with 

f(x,t) = x, f(x,t) = (x-K)+, f(x,t) = (K-x) + , (2.3) 

or 

f(x,t) = e^ T - l \x - K)+, f(x,t) = e r{ ?- l \K - x)+, (2.4) 

where K > is given and where r > is the risk-free rate. 

Options (12.3P correspond to the case when the payments are made at current time t € [0, T], 
and options (j2.4|) correspond to the case when the payment is made at terminal time T. This 
model takes into account accumulation of interest up to time T on any payoff. 

The option with payoff (12. 2D with f(x,t) = x represents a generalization of Asian option 
where the weight u(t) is selected by the holder. 

The option with payoff (|2.2p with g(x) = x represents a limit version of the multi-exercise 
options, when the distribution of exercise time approaches a continuous distribution. An ad- 
ditional restriction on \u(t)\ < const would represent the continuous analog of the requirement 
for multi-exercise options that exercise times must be on some distance from each other. For an 
analog of the model without this condition, strategies that may approach delta- functions. 

These options can be used, for instance, for energy trading with u(t) representing the quantity 
of energy purchased at time t for the fixed price K when the market price is above K. In this 
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case, the option represents a modification of the multi-exercise call option with continuously 
distributed payoff time. For this model, the total amount of energy that can be purchased is 
limited per option. Therefore, the option holder may prefer to postpone the purchase if she 
expects better opportunities in future. 

2.1 Option with non-adapted normalized weight 

A possible modification of the option described above is the option with the payoff at time T 

F u = g( r v(t)f(S(t),t)dt) , (2.5) 



\Jo J 

where g : R — > R and / : R x [0, T] — > R are given functions, the process v(t) is such that 

t-T 

v{t)dt = 1. 

o 

The difference with option (|2.2p is that the process v(t) is not assumed to be adapted to the 
filtration Tt generated by the current information flow. It is formed as 

v(t) = T Uit) , (2.6) 
Jo u (s)ds 

where the process u(t) is selected by the option holder dynamically, using the current flow of 
information, i.e., it has to be adapted to the filtration Tt describing this flow. The process u(t) 
can be called weight process, and v(t) can be called normalized wealth process. 

This setting means that the option holder keeps the writer informed about her current 
selection of the value of u(t), and these choices are recorded; the payoff occurs ate terminal time 
T. 

We don't exclude the case when do = and w(t)|tg[o,r) = 0- I n this case, the payoff can be set 
by different ways. A possible way is to define the payoff for u(t) = as F u = g S(t)dtj , 

i.e., as the limit of the payoff (|2.5[) for u(t) = e as e — > 0. Another possible selection of the payoff 
for u(t) = is F u = g (f(S(T),T)) , i.e., as the limit of the payoff (|2.5p for u(t) = eI{ t> T- £ } as 
e — > 0. In this case, v(t) can be interpreted as the delta function with the mass concentrated at 
t = T. 

These options can be useful generalizations of Asian options. Consider, for instance, a 
customer who consumes time variable and random quantity u{t) of energy per time period 
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(t, t + dt), with the price S(t) for a unit. The cumulated number of units consumed up to 
time T is u = u(t)dt; it is unknown at times t < T. To hedge against the price rise, the 
customer would purchase a portfolio of M call options; each option gives the right to purchase 
one energy unit for the price K. To minimize the impact of price fluctuations, the Asian options 
are commonly used. These options can be described as the options with the payoff (S — K) + , 
where S = T -1 J Q T S(t)dt. For accounting and tax purposes, the average price of energy for a 
particular customer has to be calculated 

1 Jo u {t)S(t)dt rather than S. Therefore, 
more certainty in financial and tax situation can be achieved if one uses the portfolio of M 
options with payoff (S u — K) + that is defined by the consumption of the particular customer. 
This is a special case of option (|2.5p . Since u is random and unknown, options (|2.2p cannot be 
used for this model. 

On impact of fixing the cumulated u(t) 

It may appear that options with payoffs (12. 5h are equivalent to the related options (|2.2p . How- 
ever, the nature of control for these options is different. 

First, the selection of u(t) is obviously more restricted for options (|2.2p than for options 
(|2.5p : the option holder have to obey the restrictions on the total amount of cumulated u(t). 
Second, these two types of the options have different opportunities with respect to possibility to 
correct past decisions. Consider a model where the option holder selects u(t) with the purpose 
to maximize the payoff F u . For the holders of options (12. 5p . it is possible to smooth the effect 
of unfortunate decisions made at previous times by selecting larger u(t) at future times. In 
addition, the relative weight of the past good decisions can be enlarged via selecting small 
current u(t). This opportunity is absent for options (12. 2|) . 

3 Market model 

We investigate pricing of the options described above for the simplest case of Black-Scholes 
model, i.e, for a complete continuous time diffusion market model with constant volatility. We 
consider the model of a securities market consisting of a risk free bond or bank account with the 
price B(t) and a risky stock with the price S(t), t G [0, T], where T > be given terminal time. 
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Ut to the end of this paper, we assume that the prices of the stocks evolves as 

dS(t) = S(t) (a(t)dt + adw(t)) , (3.1) 

where a(t) is an appreciation rate, a > is a volatility coefficient. 

In (|3.ip . w(-) is a standard Wiener process on a given standard probability space (Q, T, P), 
where = {lo} is a set of elementary events, J 7 is a complete cr-algebra of events, and P is a 
probability measure. 

The price of the bond evolves as 

B(t) = e rt B(0). (3.2) 

We assume that a > 0, r > 0, B(0) > 0, and S(0) > 0, are given constants. 

Let Tt be the filtration generated by w(t). For simplicity, we assume that a(t) is a bounded 
process progressively measurable with respect to Tt- In this case, Tt is also the filtration 
generated by S(t). 

Let P* be the probability measure such that the process e~ rt S(t) is a martingale under P* 
on [0, T]. By the assumptions for (a, a, r), this measure exists and it is unique. Let E* be the 
corresponding expectation. Under the risk neutral measure P*, 

w*(t) = w(t) + / a^ 1 [a(s) — r]ds 
Jo 

is a Wiener process, and the process S(t) is a martingale, since a(t)dt + adw(t) = adw*(t) and 
dS(t) = odw*(t). 

Admissible portfolio strategies 

Let X(0) > be the initial wealth at time t = and let X{t) be the wealth at time t > 0. We 
assume that the wealth X(t) at time t £ [0,T] is 

X{t) = P{t)B(t) + -y(t)S(t). (3.3) 

Here /3(t) is the quantity of the bond portfolio, 7(f) is the quantity of the stock portfolio, t > 0. 
The pair (/?(■)) t(")) describes the state of the bond-stocks securities portfolio at time t. Each 
of these pairs is called a strategy. 

The process X(t) = e~ rt X(t) is said to be the discounted wealth, and the process S(t) = 
e~ rt S(t) is said to be the discounted stock price. 
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Definition 3.1 A pair (/?(•), 7(-)) * s sa ^ t° be an admissible strategy if f3(t) andj(t) are random 
processes which are progressively measurable with respect to the filtration J- t and such that there 
exists a sequence of Markov times {7fc}£^ with respect to the filtration Tt such that Tp. — > T — 
a.s. and 

E I k (f3(t) 2 B(t) 2 + S(t) 2 -f{t) 2 ) dt < +00 VA: = 1, 2, ... 
Jo 

Definition 3.2 A pair (/?(•), 7(-)) is said to be an admissible self-financing strategy, if 

dX{t) = p(t)dB(t) + y(t)dS(t). (3.4) 

It is well known that (|3.4j) is equivalent to 

dX(t) = j(t)dS(t). (3.5) 

It follows that X(t) is a martingale with respect to the probability measure P*. 

Let X(0) be an initial wealth, and let X(t) be the discounted wealth generated by an ad- 
missible self-financing strategy (f3(-),j(-)). For any Markov time r such that r E [0, T], we 
have 

E*X(r) = X(0) + E* / j(t)dS(t) = X(0) + E* / -/(^Sit^dw^t) = X(0). 
Jo Jo 

4 The fair price of a general controlled option 

Let us consider a controlled option (|2.ip with the payoff F u = $(«(■), £(■)), where :WxC(0,T) 
is a measurable mapping such that sup ugW E*|F M | < +00. Here U is the set of all admissible 
controls u(-). All examples considered above are covered by this general setting. 

Definition 4.1 The fair price of an option is the price c such that 

• The option writer cannot fulfill option obligations at terminal time T using the wealth 
raised from the initial wealth X(0) < c with self-financing strategies. 

• A rational option buyer would't buy an option for a higher price than c. 

The following theorem is formulated for the case of constant r. However, this theorem holds for 
any model where the risk-neutral measure P* exists and is unique; the extension on the case of 
time variable r = r(t) is straightforward. 
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Theorem 4.1 The fair price cp of an option with the payoff F u is 

cf = e~ rT sup E*F U . 
Proofs are given in the Appendix. 

5 Pricing of options with adapted weight 

Consider an option with payoff 

F u = g( C u(t)f(S(t),t)dt] , (5.1) 



\J0 / 

where /(x, t) : (0, +oo) x [0, T] — > R and g(x) : (0, +oo) — > R are given continuous non-negative 
functions such that \ f(x,t)\ + \g(x)\ < const (\x\ + 1) and \df(x,t)/dx\ + \dg(x)/dx\ < const . In 
addition, we assume that the function g(x) is non-decreasing. 

The function u(t) is the control process that is selected by the option holder. 

We assume that S(t) and T t are such as described in section [3j 

Let U be the class of processes u(t) consisting of the processes that are adapted to the 
filtration Ft and such that 

u{t) G [d ,dx], (5.2) 

where < do < d\ < +oo. 

We consider the class U\ of admissible processes u(t) consisting of the processes u €U such 
that 

I u{t)dt = 1. (5.3) 
Jo 

To ensure that the set of admissible strategies in non-empty, we assume that doT < 1. 
By Theorem 14. 1\ the fair price of this option is 

c F = e~ rT sup ~E*F U . (5.4) 

<)e£/i 



Lemma 5.1 Assume that the function g is concave on (0, +oo). In this case, an optimal control 
for problem fl5.^|) exists inU\. 
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5.1 Pricing via dynamic programming 

It follows from the definitions that the price cf for this option can be found via solution of 
optimal stopping problem 

Maximize E*g(a;(r)) over u(-) € U, 
subject to dx(t) = u(t)f(S(t),t)dt, 
dy(t) = u(t)dt, 

dS(t) = rS(t)dt + aS{t)dw* (t) , (5.5) 

where r = T A inf {t G [0, T] : y(t) > 1}. In this case, cp = e~ rT sup u ^ eU E*<7(x(r)) given that 
x(0) = 0, y(0) = 0, 5(0) = S . 

Alternatively, the price cf for this option can be found via solution of optimal stopping 
stochastic control problem 

Maximize E*g(x(T)) over n(-) &U, 
subject to dx(t) = Ij y f t \ < iyu(t)f(S(t),t)dt, 
dy(t) = u{t)dt, 

dS{t) = rS(t)dt + aS(t)dw*(t). (5.6) 

In this case, cf = e~ rT sup u (.) eW E*g(x(T)) given that x(0) = 0, y(0) = 0, 5(0) = So- 

Problem (15.5P and (15.6|) are such that the matrix of the diffusion coefficients for the state 
process is degenerate. In addition, problem (|5.5[) involves first exit from a domain with a 
boundary, This makes it difficult to use classical methods of solution. Hence it will be more 
convenient to use (15. 6p that does not feature a boundary and first exit time. 

The state equation for problem ()5.6[) has discontinuous drift coefficient for x(t). To remove 
this feature, we approximate the problem as the following. 

Let functions (f> e (x, t) : (0, +oo) — > R be such as described in Section El e > 0. Let functions 
g e (x) : R — > R and £ e (y) : R — > [0, 1] be selected such that the following holds. 

(i) The functions £ e are non-increasing continuously differentiable and such that £ e (y) = 1 for 
y <T - e, and £ £ (y) = for y > 1 - e + e 2 . 
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(ii) The functions g £ (x) are bounded and twice differentiable. The corresponding derivatives 
are bounded, and 

g £ (x) — > g(x) as e — > 0, ge(x) < <?(x) for all x. 



Let f £ (u,x,y,s) = u£ £ (y)(f) £ (x, t). 

Consider the stochastic control following problem: 

Maximize E*g £ (:r(T)) over u(-) eU, 
subject to dx(t) = f £ (y(t),u(t), S(t),t)dt, 
dy(t) = u(t)dt, 

dS(t) = rS(t)dt + aS(t)dw*(t). (5.7) 

Consider the corresponding value function 

J e (x,y,s,t)= sup Ejg £ (x £ {T)) x (t) = x,y(t) = y,S(t) = s}. (5.8) 
u {-)eu L J 

Let D = RxRxRx [0,T]. Let X be the class of functions v(x,y,z,t) : D — > R such 
that v is continuous and there exists c > such that v(x,y, z,t) < c(\x\ + \y\ + \z\ + 1) for all 
(x, y, z, t) G D. Let X\ be the class of functions v £ X such v' x , v' y , and v' z belong to X . Let X 2 
be the class of functions v £ X\ such that w£ and v" z belong to X. 

Theorem 5.1 (i) The option price can be found as 

c F = lim e~ rT J e (0, 0, 5(0), 0). (5.9) 

e^O 

(ii) The value function J = J £ satisfies the Bellman equation 



J t + max {J'J £ + J'u} + J' s rs + ±j' s ' s cr 2 s 2 = 0, 

«e[do,di] 

J{x,y,s,T) = g £ (x). (5.10) 



TTie Bellman equation has unique solution in the class of functions J = J E (x,y, s,t) such 
that J £ (x, y, s, t) = V £ (x, y, log s, t) for some function V £ £ X 2 . The Bellman equation holds 
as an equality that is satisfied for a.e. (x, y, s, t) G R x R x (0, +oo) x [0, T\. 
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5.2 Case of linear g 

The dimension of the Bellman equation can be reduced for the case when g(x) = x. In this case, 
the option price cf can be found via solution of optimal stopping problem 

Maximize E* f u(t)f(S(t),t)dt, over u(-) eW, 
Jo 

subject to dy(t) = u(t)dt, 

dS(t) = rS{t)dt + aS{t)dw*(t), (5.11) 

where r = T A inf{i G [0, T] : y(t) > 1}. In this case, c F = e~ rT sup M( . )gW E* J Q T u(t)f(S(t), )dt 
given that y(0) = 0, 5(0) = Sq. Consider the corresponding value function 

x,t 

J(y, s, t) 4 sup E J f U u(s)f(S(s), s)ds y(t) = y, S(t) = s\. (5.12) 

Here 

tI' s = T A inf {6 € [t, T] : y + / u(g)dg > 1}. 

J s 

The option price is cp = e~ rT J(0, 5(0), 0). The Bellman equation satisfied formally by J is 

f// 2 2 

J(l,s,T) = 0, J(y,s,T) = 0. (5.13) 



J t + max { J'u + u/(s, t)} + J' s rs + ±J" s <J 2 s 2 = 0, 



The Bellman equation holds for x > 0, y < 1, s > 0, t < T. However, to derive this equation 
and prove Verification Theorem, one have to overcome again some technical difficulties arising 
from the presence of boundary and from the fact that the diffusion in the state equation is 
degenerate. Instead, we suggest to use an alternative stochastic control problem 

r 

Maximize E* / l^ y ^^u(t)f(S(t),t)dt, over u(-) 
Jo 

subject to dy(t) = u(t)dt, 

dS(t) = rS(t)dt + aS{t)dw*(t). 

This problem without does not involve first exit time. The Bellman equation satisfied formally 
by its value function J = J(y, s, t) is 

J t + max \J' x uf + J'u + l{ y <i\uf} + J' s rs + \ J' s ' s a 2 s 2 = 0, 

u£[d ,di] 

J(y,s,T) = Q. 
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The question is degenerate again, so we will use an equation with more regular coefficients as 
an approximation. 

Let functions f £ be such as described above. Consider stochastic control problem 

f T 

Maximize E* / f e (u(t),y(t), S(t),t)dt, over u(-) &U, 

Jo 

subject to dy(t) = u(t)dt, 

dS{t) = rS(t)dt + oS(t)dw*(t). (5.14) 

Consider the corresponding value function 

J £ (y,s,t) = sup E*{ f i £ (u(t),y(t),S(t))dt y(t) = y, S(t) = s\. (5.15) 

Let D 1 = R x R x [0, T]. Let y be the class of functions v (y, z,t) : D' — > R such that v is 
continuous and there exists c > such that v(y,z,t) < c(\y\ + \z\ + 1) for all (y,z,t) £ D' . Let 
y± be the class of functions v € y such v' y and v' z belong to y. Let 3^2 be the class of functions 
v G such v' t and v" zz both belong to y . 

Theorem 5.2 (i) The option price can be found as 

c F = lim e~ rT JJO, S(0), 0). (5.16) 

e— >0 



(ii) The value function J = J £ (y, s, t) for problem |5. 14\ ) satisfies the Bellman equat 



ion 

Jt + max { j'u + f e } + J' s rs + ±j'J s a 2 s 2 = 0, 

u£[do,di] 

J(y,s,T) = 0. (5.17) 

The Bellman equation has unique solution in the class of functions J = J e (x,y, s,t) such 
that J £ (y,s,t) = V £ (y,logs,t) for some function V £ G The Bellman equation holds as 
an equality that is satisfied for a.e. (y,s,t) G R x (0, +oo) x [0,T]. 

5.3 Analog of Merton Theorem 

In this section, we consider again a risky asset with the price S(t), where t G [0, T]. Consider 
an option with the payoff at time T 

rT 

u(t)f(S(t),t)dt, 
13 



where / : (0, +00) x [0,T] — > R is a given function such that \f(x,t)\ < const (1 + \x\) and 
f(x,t) > 0. Here u(t) is the control process that is selected by the option holder. The set IA\ of 
admissible processes u(t) consists of the processes that are adapted to the current information 
flow (or to the filtration, generated by S(t) and such that 



where L 6 (0, +00) is given. 

If TL < 1 then the optimal solution is u = L. Hence we assume that T > L . 

This option represents the limit version of multi-exercise options when the distribution of 
exercise times approaching a continuous distribution. This model can be used, for instance, for 
energy trading with u(t) representing the quantity of energy purchased at time t for the fixed 
price K when the market price is above K. The total amount u{t)dt of energy that can be 
purchased is limited per option. 

Merton theorem states that American and European options with the same parameters have 
the same price and that early exercise is not rational. The following theorem represents a 
extension of this theorem on the case of controlled options. 

Theorem 5.3 Let f(S(t),t) = e r ^ T ~^h(S(t)), where the function h(x) is convex and non-linear 
in x > 0, and such that at least one of the following conditions holds: 

(i) the function a _1 /i(ax) is non- decreasing in a € (0,1]; or 

(ii) r = 0. 

Then sup u (.) eWl E^i 7 ^ is achieved for the control process 



Remark 5.1 The function h(x) = (x — K) + is such that assumption (i) of Theorem 15.31 are 
satisfied (this function corresponds to the call option with continuously distributed payoff time) . 




u(t) = < 



L, t>T-l/L 



0, t<T-l/L, 



and the price of the option is 
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However, assumptions (i) of Theorem 15 .31 are not satisfied for h(x) = (K — x) + that corresponds 
to the put option. The pricing for this case with r > is an interesting problem. A solution could 
be a useful approximation of the classical optimal stopping pricing rule for American option: for 
the controlled option with restriction that u(t) € [0, L], the limit case when L — > +oo will lead 
to a Stefan problem and optimal stopping. The approximate solution for finite L may be easier 
to find since it does not require optimal stopping and solution of Stefan problem. We leave it 
for future research. 

6 Pricing for non-adapted normalized weight v (t) 

Consider an option with payoff (|2.5p . where functions f(x,t) and g(x) have the same properties 
as in Section [5j 

The function u(t) is the control process that is selected by the option holder. The set of 
admissible processes U is the set of J^-adapted processes u{t) that take values in [do,di], where 
< d < d 1 < +oo. 

We don't exclude the case when do = and w(£)|t g [o,T) = 0- I n this case, the payoff is 
assumed to be 

F u = g(f(S(T),T)), (6.1) 

i.e., as the limit as e — > of the payoffs ()2.5[) defined for u £ (t) = diI{ t>T _ £ y. 

For e > 0, let functions f £ (u,x,t) : (0, +oo) — > R and g £ (x,y) : R 2 — > R be selected such 
that the following holds. 

(i) f £ (u,x,t) = h £ (u,t)4> £ (x,t), where h £ : [d ,di} x [0,T] ->■ R and 4> e : (0,+oo) x [0, T] — > R 
are measurable functions with the following properties. 

(a) The functions <f> £ (z,t) = (f> £ (e z ,t) are bounded and continuously differentiable in 
(z,t) € R x (0,T). The corresponding derivatives are bounded. 

(b) h £ (u,t) = u{l — ip £ (t)) + ditp £ (t), where ip £ (t) : R — > [0,1] is a continuously 
differentiable non-decreasing function such that ip £ (t) = for t < T — e, ip £ (t) = d\, 
t > T -e + e 2 . 

(ii) <f) £ (x,t) < f(x,t), g £ {x,y) < g{x/y) for all x,y ^ 0,t. 
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(iii) The functions g £ (x,y) are bounded and twice differentiable in (x,y). The corresponding 
derivatives are bounded. 

(iv) (a) <f> £ (x,t) — > f{x,t) as e — > for all x,t. 

(b) If y ^ then g £ (x,y) — > g(x/y) as e — > for all x. 

(c) If, for some c € R, we have that e — > 0, y — >• 0, x/y — >• c, then g e (x, y) — > 5(c). 
Consider optimal stochastic control problem 

Maximize E*g e (x(T), y{T)) over u(-) eW, 
subject to cfcc(i) = f £ (u(t), S(t),t)dt, 
dy(t) = h £ (u(t),t)dt, 

dS(t) = rS(t)dt + aS(t)dw*(t). (6.2) 

For u £U, set 

J e (u,x,y,s,t) = E*{g e (x(T),y(T)) x(t) = x, y{t) = y, S(t) = s\. 
Consider the corresponding value function 

J £ (x,y,s,t) = sup J £ (u,x,y,s,t). (6.3) 

Let be the space introduced in Section [5j 
Theorem 6.1 (i) The option price can be found as 

c F = lim e~ rT J £ (0, 0, 5(0) , 0) . (6.4) 

(ii) The value function J = J £ satisfies the Bellman equation 

J t + max {J'J £ + J'h £ } + J'rs+ U" s a 2 s 2 = 0, 

u£[d ,di] y 

J(x,y,s,T) = g £ (x,y). (6.5) 

The Bellman equation has unique solution in the class of functions J = J £ (x,y, s,t) such 
that J £ (x, y, s, t) = V £ (x, y, log s, t) for some function V £ £ X2. The Bellman equation holds 
as an equality that is satisfied for a.e. (x,y,s,t) G R x R x (0, +00) x [0, T\. 
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Appendix: Proofs 

Proof of Theorem \4-l\ It is known that the discounted wealth is a martingale under P* for any 
admissible strategies. For a given it(-), the ability to fulfill the option obligations means that 
X(T) > F u for any u(-) i.e., X(T) > e- rT F u . Hence X(0) = E*X(T) > e~ rT F u . It follows that 

cp > e~ rT sup ~Ei*F u . 

Further, suppose that there exists e > such that, for all u(-) G U, 

c F > e~ rT E*F u + e. 

In this case, for any strategy u(-) G Z//, the claim i 7 ^ can be replicated with the initial wealth 
Xq < cp — e. Therefore, any potential option buyer could save e > quantity of cash if she 
select to replicate the payoff F u with some self-financing strategy. Therefore, a rational option 
buyer would't buy an option for the price cp. This completes the proof. □ 

Proof of Lemma I5.il Let T~i be the Hilbert space formed as the completion of the set of all 
square integrable and adapted processes in the norm of T] x fl). The set IA\ is a convex and 

closed (and, therefore, weakly closed) subset of H. Hence U\ is compact in the weak topology 
of 7~L. Consider the mapping <fi : U\ — > R such that 4>{u) = E*F U . Let {uj} C H be a sequence 
such that 

(j)(uj) — > sup (j)(u) as j — > +00. (A.l) 

There exists a subsequence {uk} and u £ U\ such that — > u G % weakly in % as /c — >• +00. 
By Mazur's Theorem (Theorem 5.1.2 from Yosida (1995)), there exists a sequence of integer 
numbers k = ki — > +00 such that there exist sets of real numbers {a m fc}^ =1 C [0, 1] such that 

ELi = 1 and that 

k 

Uk = ^2 a mk u m u in Ti. (A. 2) 

m=l 

In addition, there exists a subsequence {u m } of this sequence such that u m — >• -u a.e. as £; — ^ +00. 
Consider mappings G(u, y) : x C([0, T] -> R) and J(u, y) : U\ x C([0,T] -> R) such that 
J(u,y) = u(t)f(y(t),t)dt and G(u,y) = g(I(u,y)). Clearly, G(u k ,S(-)) G(u,S(-)) a.s. as 
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k — > +00. In addition, 

|-f(u, 5)1 <T max \f(S(t), t)\ maxu(t) < const • d\T max (Sit) + 1), 
te[o,T] te[o,T] 

|G(u,5)| < const 7(u,5(-)). 

By the Lebesgue's Dominated Convergence Theorem, it follows that (j)(uk) — > </>(tt). 

By linearity of /, we have that I(uk, = El=i amkl( u m, By the concavity of g, it 

follows that 

#(J(«fc, £(•))) > 5 I a mkl(u m , S(-)) I . 

\m=l / 

Hence 

^*(wfc) > CL m f,<fi(u m ) — > sup </>(it) as fc — > +00. 
m=l ueWl 

Hence </>(u) = sup ne -^0(«), i.e., u is a optimal control. □ 

Proof of Theorem \5.1[ Let us prove statement (i). By Lemma [5 -H cf = e~ rT 'Ei*Fqg for some 

u € U\. Let y(t) = J* u(s)ds. By the assumptions on g e , </> e , it follows that e~ rT J £ (0, 0, S(0), 0) < 

cp for any e > and 



J e (0,0,5(0),0) > E*g £ Qf f £ (u(t),y(t),S(t),t)dt 



Moreover, 

g e n f e (u(t),y(t),S(t),t)dt) -»• jfjf 2(t)% (t)<1} /(S(T),t)^J =F S a.s. as e -> 0. 

By the Lebesgue's Dominated Convergence Theorem and by the assumptions on g £ ,</> £ , state- 
ment (i) follows. 

Let us prove statement (ii). Let us consider the change of variables R(t) = lnS'(t). Using the 
Ito formula, we obtain that this change of variables transfers the corresponding control problem 

as 

Maximize E„,g £ (x(T)) over u(-) G U, 
subject to dx{t) = f £ (u(t), y(t), e R( - t] , t)dt, 
dy(t) = u(t)dt, 

dR{t) = (r - a 2 /2)dt + adw*(t). 
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Consider the corresponding value function 



V(x, y, z, t) = sup E 
u{-)eu 



(s(T)) |x(i) = x, y(i) = y, i?(t) = 



Again, the coefficients of this problem are such that the assumptions of Theorem 4.1.4 and 
Theorem 4.4.3 from Krylov (1980), p. 167, 192, are satisfied. 

By Theorem 4.1.4 from Krylov (1980), p. 167, this function satisfy the corresponding parabolic 
Bellman equation that has unique solution V £ X\. The Bellman equation holds in the gen- 
eralized sense, i.e., as an equality of the distributions. This equation includes only one partial 
derivative of the second order, V" z presented with the coefficient a 2 /2 > 0. By Theorem 4.4.3 
from Krylov (1980), p. 192, the derivative V t '(x,y, z,t) belongs to X. It follows that V" z € X. 
The Bellman equation for J(x, y, s, t) = V(x, y, log s, t) is defined by the equation for V with the 
corresponding change of variables. Then the proof of statement (ii) and Theorem 15.11 follows. 



Proof of Theorem 15.21 is similar to the proof of Theorem 15.11 

Proof of Theorem 15.31 First, note that, by Lemma I5.1[ it follows that a optimal control 
u G U\ exists. Consider a linear mapping <p(u) : £Yi — >• R such that (f>(u) = E^F^. 

Let u(-) 7^ u(-) be a process in IA\. Let us show that u(-) cannot be optimal. Since u(-) ^ 
u(-), there exist a equivalent in ^([0, T] x $7) version of u(-), v G [0, L], non-random times 
to G [0, T — L^ 1 ], t\ G (T — L _1 ,T] and a set Qq G J 7 ^, such that P*(f2o) > 0, tf, are Lebesgue 
points for u(t,oj)f(S(t,u),t) for all oj G £lo, and, for small enough e > 0, 

u(to, oj) > v, V(t, oj) = u(t, oj) + u(t — t\ + to, oj) — v G [0, L] Vt G J £ , oj G Oo, 

where J £ = \t\ — e,t\). Note that feasibility of the property that V(t,u) G [0, L) can be seen 
from the existence of to,ti,^o such that ess sup tg/e (u(t, oj) — v) < and esssup tg j r u(t, oj) < L 
for all oj G rig- 
Let I £ = [to, to + £) an d let u e (t) be constructed as the following: 



□ . 



u(t), t£l £ UJ e 



u £ {t) = < 



V 



t G I, 



V(t), t€J e . 



V 
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Let us show that u E (-) € U\ for small enough e > 0. Clearly, this process is adapted to T t . By 
the definition of IA\, 

rto 



and 



pi rto f-i r 

/ u(t)dt = / u{t)dt + / u(t)dt + / u(t)dt = 1 

JO JO Jt! JhUJe 

f-T rto fT r 

/ u £ {t)dt= / u(t)dt+ / + / F(t)<ft 

JO Jo Jt! J J e 

= I u(t)dt+ I u{t)dt + ev+ I u(t)dt+ I u(t - (h - t ))dt - ev 

JO Jt! Jj £ Jj £ 

= [ u(t)dt = 1, 
Jo 

since Jj u(t — t\ + to)dt = Jj u(t — t\ + to)dt = 1. Hence u e (-) € U\ for small enough e > 0. 
To prove that u(-) is not optimal, it suffices to show that 



lim ^ ~ > 0. (A.3) 

e^0+ e 



We have that 

{u £ ) - (j)(u) 



lim ■ 

£->0 + 



lim e 1 E*I n 

£->0 + 



= lim E*In 



f (v-u(t))f(S(t),t)dt + [ (V(t)-u(t))f(S(t),t)dt 

h JJe \ 

1 (u - u(t))f(S(t),t)dt + e- 1 _ u(t))f(S(t), t)dt 



£ 

= E*In [(« - u(t ))f(S(t ), t ) + (u(ti) + «(t ) - « - u(ti))/(S(ti), ti)] 

= EJn [(« - u(t ))f(S(t ), t ) + (u(t ) - v)f(S(t 1 ),t 1 )] 

= E*In E* - u(t ))f(S(t ), to) + (u(t ) - v)f(S(t 1 ),t 1 )\T to } 

= E»In (u(t ) - u) [E„ {/(5(ti),ti)|Jio} - /(5(to),*o)] • (A.4) 

Further, we have that S(t) = S{t)e~ rt is a martingale under P*. We have also that the 
support of the conditional distribution of S(t\) given S(to) is (0, +oo). Since g(-) is convex and 
non-linear, it follows from the Jensen's inequality that 

E4h(S(h))\F t0 } = E^hie^Sih))^} > h(e rt ^S(t )) = h(e r ^-^ S(t )). 

By the properties of h, we have that 

e- rih - to) h{e r(tl - to) S(to)) > h(S(t )). 
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Hence 

h(e r ^- t0 ^S(t )) > e r ^- t0 ^h(S(t )), 

and 

E^hiSih))^} > h(e r ^-^S(t )) > e r ^-^h(S(t )). 

Hence 

EAf(S(ti),h)\F t0 } = e^'^mSih))^} > e^ T -^e r ^-^h(S(t )) = f(S(t ),t ). 

By ()A.4p . we obtain that limit (|A.4p is positive. Hence (|A.3P holds and the proof follows. □ 
Proof of Theorem \6. 1[ Let us prove statement (i). Let {ui} cWbea sequence such that 

E*F ni -> supE*F u as i ->■ +00. (A. 5) 

Let be defined similarly to F u with (/, 5) replaced by (4> £ ,g £ ). Clearly, for any u Eli, there 
exists u £W such that E*F£ = ^(u, 0,0, 5(0), 0). Hence J e (0, 0, 5(0), 0) < sup u6W E*i^. By 
the properties of (f £ ,g £ ), it follows that sup ugW E*F^ < e rT CF- Hence e~ rT J £ (0, 0, 5(0), 0) < cp 
for any e > 0. Moreover, 

J £ (0,0,5(0),0) >E*g £ Q t £ ( Ui {t),S(t),t)dt,^ h E ( Ui {t),t)dt 

Let i be fixed. If Ui(t)dt > then, by the Lebesgue's Dominated Convergence Theorem 
and by the assumptions on g, /, 

& ( [ T uui(t),s(t),t)dt, [ T h £ (u t (t),t)dt) ^J foMtms(nt)dt \ =p ^ 

\Jo Jo J \ J Ui(t)dt J 

a.s. as e — > 0. 

If Ui(t)dt = then, by assumption (|6.1|) . 

g e ^ f £ ( Ul (t),S(t),t)dt, h e ( Ui (t),t)dt 

/ t-T-e+e 2 rT 

= gj h £ ( Ui (t),t)f(S(t),t)dt+ dxf{S{t),t)dt, 

\JT-e JT-e+e 2 

fT-E+E 2 fT 

\ h £ (ui(t),t)dt + / didt 

JT-E JT-e+e 2 



g £ (o(e 2 ) + jf 



T 

d!f(S(t),t)dt, 0(e 2 ) + (e - e 2 )di 
g(f(S{T),T))=F Ui a.s. as e^0 
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again. By the Lebesgue's Dominated Convergence Theorem and by the assumptions on g, f 
again, it follows that 



^(^,0,0, 5(0), 0) ^E*F Ui as e -»■ 0. (A.6) 

We now in the position to prove statement (i). It suffices to show that, for any 5 > 0, there 
exists e* > such that J e (0, 0, 5(0), 0) > sup ueW E*_F u — 5 for e < e*. Let i be such that 
i such that E # i^ > sup ue ^E#.F u — 5/2. By (|A.6j) . there exists £* = e* (<5, i) > such that 
J e (ui,0,0,S(0),0) > E*F U . - 5/2 for all e < ei. Hence J e (0, 0, 5(0), 0) > E*F Ui - 5/2 and 
J e (0, 0, 5(0), 0) > sup ueW E*F u for these e. Then statement (i) follows. 

Let us prove statement (ii). Let us consider the change of variables R(t) = In 5(i). Using the 
Ito formula, we obtain that this change of variables transfers the corresponding control problem 
as 

Maximize E*g e (:r(T), y(T)) over u(-) G W, (A. 7) 

subject to dx{t) = f e (u(t), e R ^ ,t)dt, 
dy(t) = h £ (u(t),t)dt, 

dR{t) = (r - a 2 /2)dt + crdw*(t). (A.8) 

Consider the corresponding value function 

V(x,y,z,t)= sup Ejg £ {x(T),y(T)) x(t) = x, y(t) = y, R(t) = z\ . (A.9) 
u(-)eU 1 } 

Note that the coefficients of this problem are such that the assumptions of Theorem 4.1.4 and 
Theorem 4.4.3 from Krylov (1980), p. 167, 192, are satisfied. The remaining part of the proof 
repeats the proof of Theorem 15. ll fii). This completes the proof of Theorem 16.11 □ 
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